翻訳と辞書 |
Spoken English Corpus : ウィキペディア英語版 | Spoken English Corpus The Spoken English Corpus (SEC) is a speech corpus used in corpus linguistics consisting of a collection of recordings of spoken British English compiled during the period 1984-7 through a collaboration, funded by IBM, between the Unit for Computer Research on the English Language (UCREL) at the University of Lancaster and the IBM Scientific Centre in Winchester.〔 The corpus comprises 53 recorded passages, mainly recorded from the BBC, spoken in the accent usually referred to as Received Pronunciation, or RP. It covers categories such as commentary. news broadcast, lecture and dialogue.〔 The corpus contains 52,637 words, in a recording time of 339 minutes. The compilation of the corpus is described by Lita Taylor in her 1996 article "The Compilation of the Spoken English Corpus."〔 == Transcription of the recordings ==
A system was devised for transcription of the intonation of the material in the recordings, and two transcribers, Gerry Knowles and Briony Williams, analysed the entire corpus. The transcription system is explained by Williams,〔 and an experiment was conducted by Brian Pickering to assess the degree of agreement between the two transcribers on a section of the Corpus containing around 1000 tone-units which was transcribed by both transcribers.〔 Good agreement was found.
抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Spoken English Corpus」の詳細全文を読む
スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース |
Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.
|
|